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Abstract 

Define  a*,  to  be  the  smallest  positive  multiple  of  k  such  that  the  sum  of  its  digits  in 
base  q  is  equal  to  k.  The  asymptotic  behavior,  lower  and  upper  bound  estimates  of  ak 
are  investigated.  A  characterization  of  the  minimality  condition  is  also  considered. 


1  Motivation 


A  positive  integer  n  is  a  Niven  number  (or  a  Harshad  number )  if  it  is  divisible  by  the  sum  of 
its  (decimal)  digits.  For  instance,  2007  is  a  Niven  number  since  9  divides  2007.  A  q-Niven 
number  is  an  integer  k  which  is  divisible  by  the  sum  of  its  base  q  digits,  call  it  sq(k)  (if 
q  =  2,  we  shall  use  s(k)  for  S2(k)).  Niven  numbers  have  been  extensively  studied  by  various 
authors  (see  Cai  [3],  Cooper  and  Kennedy  [4],  De  Koninck  and  Doyon  [5],  De  Koninck, 
Doyon  and  Katai  [6],  Grundman  [7],  Mauduit,  Pomerance  and  Sarkozy  [11],  Mauduit  and 
Sarkozy  [12],  Vardi  [16],  just  to  cite  a  few  of  the  most  recent  works). 

In  this  paper,  we  define  a  natural  sequence  in  relation  to  g-Niven  numbers.  For  a  fixed 
but  arbitrary  k  E  N  and  a  base  q  >  2,  one  may  ask  whether  or  not  there  exists  a  g-Niven 
number  whose  sum  of  its  digits  is  precisely  k.  We  will  show  later  that  the  answer  to  this  is 
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affirmative.  Therefore,  it  makes  sense  to  define  a\~  to  be  the  smallest  positive  multiple  of  k 
such  that  Sg(afc)  =  k.  In  other  words,  is  the  smallest  Niven  number  whose  sum  of  the 
digits  is  a  given  positive  integer  k.  We  denote  by  Ck  the  companion  sequence  Ck  =  dk/k , 
k  G  N.  Obviously,  a*,,  respectively,  Ck,  depend  on  q.  but  we  will  not  make  this  explicit  to 
avoid  cluttering  the  notation. 

In  this  paper  we  give  constructive  methods  in  Sections  3,  4  and  7  by  two  different 
techniques  for  the  binary  and  nonbinary  cases,  yielding  sharp  upper  bounds  for  a&.  We  find 
elementary  upper  bounds  true  for  all  k,  and  then  better  nonelenrentary  ones  true  for  most 
odd  k. 

Throughout  this  paper,  we  use  the  Vinogradov  symbols  2>  and  <C  and  the  Landau 
symbols  O  and  o  with  their  usual  meanings.  The  constants  implied  by  such  symbols  are 
absolute.  We  write  x  for  a  large  positive  real  number,  and  p  and  q  for  prime  numbers.  If 
A  is  a  set  of  positive  integers,  we  write  A(x)  =  yin  [l,x].  We  write  In x  for  the  natural 
logarithm  of  x  and  logx  =  maxjlnx,  1}. 

2  Easy  proof  for  the  existence  of  ap 

In  this  section  we  present  a  simple  argument  that  shows  that  the  above  defined  sequence  ak 
is  well  defined.  First  we  assume  that  k  satisfies  gcd (k,q)  =  1.  By  Euler’s  theorem,  we  can 
find  an  integer  t  such  that  qt  =  1  (mod  k),  and  then  define  K  =  1  +  qt  +  q2t  +  •  •  •  +  gC*-1)* . 
Obviously,  K  =  0  (mod  k),  and  also  sq(I\ )  =  k.  Hence,  in  this  case,  K  is  a  Niven  number 
whose  digits  in  base  q  are  only  0’s  and  l’s  and  whose  sum  is  k. 

If  k  is  not  coprime  to  q,  we  write  k  =  ab  where  gcd (b,  q)  =  1  and  a  divides  qn  for  some 
n  G  N.  As  before,  we  can  find  K  =  0  (mod  b)  with  sq{K )  =  b.  Let  u  =  max{n,  [log9  K~\  }+l, 
and  define  K'  =  (qu  +  q2u  +  •  •  •  +  qua)K.  Certainly  k  =  ab  is  a  divisor  of  K'  and  sq{K')  = 
ab  =  k.  Therefore,  ak  is  well  defined  for  every  k  G  N. 

This  argument  gives  a  large  upper  bound,  namely  of  size  exp(0(fc2))  for  ak- 

We  remark  that  if  m  is  the  minimal  g-Niven  number  corresponding  to  k,  then  q  —  1 
must  divide  m  —  sq(m)  =  kck  —  k  =  ( Ck  —  1)£;.  This  observation  turns  out  to  be  useful  in 
the  calculation  of  Ck  for  small  values  of  k.  For  instance,  in  base  ten,  the  following  table  of 
values  of  ak  and  Ck  can  be  established  easily  by  using  the  previous  simple  observation.  As 
an  example,  if  k  =  17  then  9  has  to  divide  C17  —  1  and  so  we  need  only  check  10, 19,  28. 
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k 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 

Cfc 

19 

19 

4 

19 

19 

13 

28 

28 

11 

46 

199 

19 

109 

73 

ak 

190 

209 

48 

247 

266 

195 

448 

476 

198 

874 

3980 

399 

2398 

1679 

3  Elementary  bounds  for  in  the  binary  case 

For  each  positive  integer  k  we  set  n k  =  [ log2  k~\  •  Thus,  nk  is  the  smallest  positive  integer 
with  k  <  2Uk.  Assuming  that  k  £  N  {k  >  1)  is  odd,  we  let  tk  be  the  multiplicative  order  of 
2  modulo  k,  and  so,  2tk  =  1  (mod  k).  Obviously,  tk  >  rik  and  tk  \  where  <p  is  Euler’s 

totient  function.  Thus, 

nk  <tk  <  k  -  1.  (1) 

Lemma  1.  For  every  odd  integer  k  >  1,  every  integer  x  G  {0, 1, . . . ,  k  —  1}  can  be  repre¬ 
sented  as  a  sum  modulo  k  of  exactly  nk  distinct  elements  of 

D  =  {2*  \  i  =  0, . . .  ,nk  +  k  —  2}. 

Proof.  We  find  the  required  representation  in  a  constructive  way.  Let  us  start  with  an 
example.  If  x  =  0  and  k  =  2Uk  —  1,  then  since  x  =  k  (mod  k),  we  notice  that  in  this 
case  we  have  a  representation  as  required  by  writing  k  =  1  +  2  +  •  •  •  +  2nk~ 1  (note  that 
nk  —  1  <  nk  +  k  —  2  is  equivalent  to  k  >  1). 

Any  x  G  {0, 1, . . . ,  k  —  1}  has  at  most  nk  bits  of  which  at  most  nk  —  1  are  ones.  Next,  let 
us  illustrate  the  construction  when  this  binary  representation  of  x  contains  exactly  nk  —  1 
ones,  say 

x  =  2nfc_1  +  2nfc~2  +  •  •  •  +  2  +  1  —  2\  for  some  j  G  {0, 1, . . . ,  nk  —  1}. 

First,  we  assume  j  <nk  —  2.  Using  2-7+1  =  2-?  +  2J  =  2-7  +  2^+tk  (mod  k),  we  write 

x  =  2j+tk  +  2nk~1  + - b  2j+2  +  2j  +  2j~ 1  + - f  1  (mod  k), 

where  both  j  +  tk  <  nk  —  2  +  k  —  1  =  nk  +  k  —  3  and  j  +  tk  >  nk  —  1  are  true  according  to  (1) . 
Therefore  all  exponents  are  distinct  and  they  are  contained  in  the  required  range,  which 
gives  us  a  representation  of  x  as  a  sum  of  exactly  nk  different  elements  of  D  modulo  k. 

If  j  =  nk  —  1,  then  x  =  2nfe^1  —  1.  We  consider  x  +  k  instead  of  x.  By  the  definition 
of  nk,  we  must  have  k  >  2nk~l  +  1.  Hence,  x  +  k  >  2nk ,  which  implies  that  the  binary 
representation  of  x+k  starts  with  2nk  and  it  has  at  most  nk  ones.  Indeed,  if  s(x+k)  >  nk+ 1, 
then  x  +  k  >  2nk  +  2nk~l  +  ■  ■  ■  +  2  +  1  =  2nfe+1  —  1,  which  in  turn  contradicts  the  inequality 
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x  +  k  <  k  —  1  +  k  =  2k  —  1  <  2nk+l  —  3  since  k  is  odd.  If  s(x  +  k)  =  rik ,  then  we  are 
done  (k  >  3).  If  s(x  +  k)  =  rik  —  1,  then  we  proceed  as  before  and  observe  that  this  time 
j  +  tk  <  rik  +  k  —  2  for  every  j  £  {0, 1, 2, ... ,  nk  —  1}  and  j  +  tk  >  nk  if  j  >  0  which  is  an 
assumption  that  we  can  make  because  in  order  to  obtain  nk  —  1  ones  two  of  the  powers  of 
2,  out  of  1,  2, 22, . . . ,  2nk~1,  must  be  missing. 

If  s(x  +  k)  <  nk  —  1,  then  for  every  zero  in  the  representation  of  x  +  k,  which  is 
preceded  by  a  one  and  followed  by  £  {£  >  0)  other  zeroes,  we  can  fill  out  the  zeros  gap  in 
the  following  way.  If  such  a  zero  is  given  by  the  coefficient  of  2J,  then  we  replace  2J+1  by 
2]  +  2J_1  +  •  •  •  +  +  2i~e+tk .  This  will  give  £  +  2  ones  instead  of  a  one  and  7  +  1  zeros.  We 

fill  out  all  gaps  this  way  with  the  exception  of  the  gap  corresponding  to  the  smallest  power 
of  2  and  £>  1,  where  in  order  to  insure  the  inequality  j'  +  tk  >  nk  {j'  =  j  —  l  +  1  >  0)  one 
will  replace  2-?+1  by  2J  +  2-?-1  +  •  •  •  +  2J~^+1  +  2J_^+1+tfe .  The  result  will  be  a  representation 
in  which  all  the  additional  powers  2^'+tk  will  be  distinct  and  the  total  number  of  powers  of 
two  is  nk ■  The  maximum  exponent  of  these  powers  is  at  most  j'  +  tk  <  nk  +  k  —  2. 

If  the  representation  of  x  starts  with  2nk~1,  then  the  technique  described  above  can  be 
applied  directly  to  x  making  sure  that  all  zero  gaps  are  completely  filled.  Otherwise,  we 
apply  the  previous  technique  to  x  +  k.  □ 

Example  2.  Let  k  =  11.  Then  tin  =  4  and  tu  =  10.  Suppose  that  we  want  to  represent  9 
as  a  sum  of  4  distinct  terms  modulo  11  from  the  set  D  =  { 1,2, .. .  ,213}.  Since  9  =  23  +  1, 
we  have  9  =  22 +  2  +  2  +  1,  so  9  =  22  +  2  +  211  +  1  (mod  11).  If  we  want  to  represent 
7  =  22  +  21  +  2°  then,  since  this  representation  does  not  contain  23,  we  look  at  7+11  = 
18  =  24  +  2  =  23  +  23  +  2  =  23  +  22  +  22  +  2.  Thus,  7  =  23  +  22  +  212  +  2  (mod  11). 

We  note  that  the  representation  given  by  Lemma  1  is  not  unique.  If  this  construction  is 
applied  in  such  a  way  that  the  zero  left  when  appropriate  is  always  the  one  corresponding 
to  the  largest  power  of  2,  we  will  obtain  the  largest  of  such  representations.  In  the  previous 
example,  we  can  fill  out  the  smallest  gap  first  and  leave  a  zero  from  the  gap  corresponding 
to  23,  so  7  =  18  =  24  +  2  =  23  +  23  +  1  +  1  =  23  +  213  +  1  +  210  (mod  11). 

Recall  that  2“||m  means  that  2“  |  m  but  2“+1  {  m.  We  write  //2 (m)  for  the  exponent  a. 

Theorem  3.  For  all  positive  integers  k  and  £,  there  exists  a  positive  integer  n  having  the 
following  properties: 


(a)  s(nk )  =  £k, 

(, b )  n  <  (2lk+nk  -  2^)/k. 
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1  and 


Proof.  It  is  clear  that  if  A:  is  a  power  of  2,  say  k  =  2s,  then  we  can  take  n  =  2^k  — 
so  s(kn )  =  s(2s  +  2S+1  +  •  •  •  +  2s+tt:~1)  =  £k.  In  this  case,  the  upper  bound  in  part  (6)  is 
sharp  since  nk  =  s  =  /i2 {k). 

Furthermore,  if  k  is  of  the  form  k  =  2 md  for  some  positive  integers  m,  d  with  odd  d  >  3, 
then  assuming  that  we  can  find  an  integer  n  <  (2 2m&z+n/c'  _  fd,  where  nk  =  flog2  d\ ,  such 
that  s(nd)  =  2 m£d,  then  nk  satisfies  condition  (a)  since  s(nk)  =  s(2mnd)  =  s(nd)  =  2 m£d  = 
Ik.  We  observe  that  condition  (6)  is  also  satisfied  in  this  case,  because  (22'n(d+nk  —  1  )/d  = 
(2 ek+nk  -  2m) /k. 

Thus,  without  loss  of  generality,  we  may  assume  in  what  follows  that  k  >  3  is  odd. 
Consider  the  integer  M  =  2ek+rtk  —  1  =  1  +  21  +  --  -  +  2^c+"fc~1,  and  so,  s(M)  =  £k  +  n^. 
By  Lemma  1,  we  can  write 

M  =  2jl  +2j2  +  ■■■  +  2jnk  (mod  k ) ,  (2) 

where  0  <  j\  <  j2  <  ■  ■  ■  <  jn  <  k  +  —  2  <  £k  +  nk  —  1.  Therefore,  we  may  take 

M  -  (2^+2^  + - b2>u-) 

n=  - ~k - ’ 

which  is  an  integer  by  (2)  and  satisfies  s(nk)  =  s(M  —  (2J1  +  2-72  +  •  •  •  +  2Jnfc))  =  Afc.  □ 
Corollary  4.  TTie  sequence  (ak)k> i  satisfies 

2k  -l  <ak<  2k+nk  -  2^k\  (3) 

Proof.  The  first  inequality  in  (3)  follows  from  the  fact  that  if  s(ak)  =  k,  then  ak  >  1  +  2  + 
•  •  •  +  2fc_1  =  2k  —  1.  The  second  inequality  in  (3)  follows  from  Theorem  3  by  taking  £  =  1, 
and  from  the  minimality  condition  in  the  definition  of  a^.  □ 

We  have  computed  ak  and  ck  for  all  k  =  1, . . . ,  128, 

ci  =  1,  c2  =  3,  c3  =  7, . . . ,  c20  =  20  971  5, .... 

and  the  graph  of  k  — >  ln(cfc)  against  the  functions  k  — >  ln(2fc)  and  k  ln(2fc  —  1)  —  ln(A)  is 
included  in  Figure  1. 

The  right  hand  side  of  inequality  (3)  is  sharp  when  k  =  2s,  as  we  have  already  seen.  For 
k  =  2s  —  1,  we  get  values  of  ck  very  close  to  2k  —  1  but,  in  general,  numerical  evidence  shows 
that  ck/2k  is  closer  to  zero  more  often  than  it  is  to  1.  In  fact,  we  show  in  Section  6  that  this 
is  indeed  the  case  at  least  for  odd  indices  (see  Corollary  11,  Corollary  12  and  relation  (23)). 
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Figure  1:  The  graphs  of  k  — »  ln(cfc)  and  k  — >  ln(2fc),  k  — > ln(2fc  —  1)  —  ln(fc) 

4  Improving  binary  estimates  and  some  closed  formulae 

In  order  to  obtain  better  bounds  for  a*,,  we  introduce  the  following  classes  of  odd  integers. 
For  a  positive  integer  m  we  define 

m 

Cm  =  {k=  1  (mod  2)|  2k+m  —  1  =  ^  2Ji  (mod  k),  for  0  <  j\  <  j?  <  •  •  •  <  jm  <  m  +  k  —  2}. 

i=l 

Let  us  observe  that  Cm  C  Cm+ 1.  Indeed,  if  /c  is  in  Cm,  we  then  have  2fc+m  — 1  =  2J1  +  -  •  •+2-?m 
for  some  0  <  j\  <  ji  <  ■  ■  ■  <  jm  <  m  +  k  —  2.  Multiplying  by  2  the  above  congruence  and 
adding  one  to  both  sides,  we  get  2k+m+l  —  1  =  1  +  2J1+1  +  •  •  •  +  2Jm+1 ,  representation  which 
implies  that  k  belongs  to  Cm+i.  Note  also  that  Lemma  1  shows  that  every  odd  integer  k  >  3 
belongs  to  Cu,  where  u  =  [ log  k / log  2] .  Hence,  we  have  2N  +  1  =  UmeN^m- 

Theorem  5.  For  every  k  £  C\,  vue  have 

2k  -  1  <  ak  <  2k+1  -  1. 

In  particular,  Ck/2k  — >  0  as  k  — ►  oo  through  C\.  Furthermore,  ak  =  2k+l  —  1  —  2JI ,  where 
ji  =  jo  +  st/c,  with  s  =  [(fc  —  1  —  jo)AfcJ;  and  0  <  jo  <  tfc  —  1  is  such  that  2k+1  —  1  =  2-70 
(mod  fe). 

Proof.  We  know  that  2fc  —  1  ^  0  (mod  /c)  (see  [13,  Problem  37,  p.  109]).  Hence,  an  integer 
of  binary  length  k  whose  sum  of  digits  is  k  is  not  divisible  by  k.  Therefore,  ak  >  2k  —  1. 
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Next,  we  assume  that  ak  is  an  integer  of  binary  length  k  + 1  and  sum  of  digits  k;  that  is, 
ak  =  2k+l  —  1  —  2J  for  some  j  =  0, . . . ,  k  —  1.  But  2k+1  —  1  =  x  (mod  k),  and  by  hypothesis 
there  exists  jo  such  that  x  =  2J0  for  some  jo  G  {0, . . .  ,tk  —  1}.  In  order  to  obtain  ak,  we 
need  to  subtract  the  highest  power  of  2  possible  because  of  the  minimality  of  a*,.  So,  we 
need  to  take  the  greatest  exponent  j\  =  jo  +  stk  <  k  —  1,  leading  to  s  =  |_( k  —  1  —  jo)/tk\. 
Hence,  ak  =  2k+1  -  1  -  2-h .  □ 

Based  on  the  above  argument,  we  can  compute,  for  instance,  05  =  55  =  26  —  1  —  23, 
since  23  —  1  =  23  (mod  5).  Similarly,  029  =  230  —  1  —  25  =  1073741791,  since  230  —  1  =  25 
(mod  29),  and  025  =  226  —  1  —  219  =  665845  75,  since  226  —  1  =  219  (mod  25),  or  perhaps  the 
more  interesting  example  0253  =  2254  —  1  —  2242. 

Theorem  6.  If  m  G  N,  and  k  G  Cm+ 1  \  Cm,  we  then  have 

2k+m~ 1  -  1  <ak<  2k+m  -  1. 

Thus,  ck/2k  — >  0  as  k  — >  00  in  Crn  for  any  fixed  m. 

Proof.  Similar  as  the  proof  of  Theorem  5.  □ 

Theorem  7.  For  all  integers  k  =  2*  —  1  >  3,  we  have 

ak  <  2k+k~  +  2k  -  2k~i  -  1,  (4) 

where  k~  is  the  least  positive  residue  of  —k  modulo  i.  Furthermore,  the  bound  (4)  is  tight 
when  k  =  2l  —  1  is  a  Mersenne  prime.  In  this  case,  we  have  ck/2k  — >  1/2  as  k  — >  00  through 
Mersenne  primes,  assuming  that  this  set  is  infinite. 


Proof.  For  the  first  claim,  we  show  that  the  sum  of  binary  digits  of  the  bound  of  the  upper 
bound  on  (4)  is  exactly  k,  and  also  that  this  number  is  a  multiple  of  k.  From  the  definition 
of  k~ ,  we  find  that  k  +  k~  =  ia  for  some  positive  integer  a.  Since 


2  k+k 


+  2k  -  2k~l  -  1 


2k~i(2i  -  1)  +  2ia  -  1 

(2*  -  l)(2fc_*  +  2i{-a~l)  +  2*(a~2)  + - hi), 


we  get  that  2k+k  +  2k  —  2k  1 


1  is  divisible  by  k.  Further,  k  >1  since  k  is  not  divisible 
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by  i  (see  the  proof  of  Theorem  5),  and 


s  (2k+k~  +  2k  -  2k~i  -  l)  =  s(2k+k~~1  + - b2+l  +  2fe  —  2fc~*) 

=  s(2k+k~~1-\ - b  2k  H - b  2fc_i  H - b  2  +  1  + 

=  s  (2k+k~  +  2fc_1  H - b  2fc-*  H - b  2  +  l) 

=  k, 

where  t  means  that  t  is  missing  in  that  sum.  The  first  claim  is  proved. 

We  now  consider  a  Mersenne  prime  k  =  2*  —  1.  First,  we  show  that  k  £  Cj  \  Cj_ i .  Since 
u  =  [log  k/  log  2]  =  i,  by  Lemma  1,  we  know  that  k  £  Cj.  Suppose  by  way  of  contradiction 
that  k  £  Cj_i.  Then 

2k+i- 1  -  1  =  2J1  +  •  •  •  +  2*"1  (mod  jfe)  (5) 

holds  with  some  0  <  ji  <  j2  <  ■  ■  ■  <  ji-i  <  k  +  i  —  3.  Since  k  is  prime,  we  have  that 
2fc_1  =  1  (mod  A:),  and  so  2fc+*_1  —  1  =  2*  —  1  =  0  (mod  k ). 

Because  2*  =  1  (mod  /c),  we  can  reduce  all  powers  2J  of  2  modulo  k  to  powers  with 
exponents  less  than  or  equal  to  i  —  1.  We  get  at  most  i  —  1  such  terms.  But  in  this  case,  the 
sum  of  at  least  one  and  at  most  i  —  1  distinct  members  of  the  set  {1,2,...,  2*~1}  is  positive 
and  less  than  the  sum  of  all  of  them,  which  is  k.  So,  the  equality  (5)  is  impossible. 

To  finish  the  proof,  we  need  to  choose  the  largest  representation  x  =  2n  +  •  •  •  +  2Ji, 
with  0  <  j\  <  j2  <  ■  ■  ■  <  ji  <  k  +  i  —  2,  such  that  2k+l  —  1  =  x  (mod  k).  But  2k+t  —  1  = 
2*+1  —  1  =  1  (mod  k).  Since  the  exponents  j  are  all  distinct,  the  way  to  accomplish 
this  is  to  take  jt  =  k  +  i  —  2,  jj_i  =  k  +  i  —  3, . . . ,  j2  =  k,  and  finally  j i  to  be  the 
greatest  integer  with  the  property  that  the  resulting  x  satisfies  x  =  1  (mod  k).  Since 

x  =  2-h  +  2fc(l  +  2  + - b  2*“2)  =  2h  +  2fc(2?-1  -  1)  =  2 h  +  2i  -  2  =  2^  -  1  (mod  fc), 

we  need  to  have  2J1  =  2  (mod  k).  Since  the  multiplicative  order  of  2  modulo  k  is  clearly 
i,  we  have  to  take  the  largest  j\  =  1  +  si  such  that  1  +  si  <  k.  But  i  must  be  prime  too 
and  so  2l~1  =  1  (mod  i).  This  implies  k  =  2*  —  1  =  1  (mod  i).  Therefore  ji  =  k  —  i.  So, 
ak  =  2k+i  -  1  -  x  =  2k+i  -  1  -  2k~i  -  2k+i~1  +  2k  =  2fc+i"1  +  2k  -  2k~{  -  1  and  the  inequality 
given  in  our  statement  becomes  an  equality  since  k~  =  i  —  1  in  this  case. 

Regarding  the  limit  claim,  we  observe  that 

ck  _k  + 11  1  1  1 

2k~  2k  +k~k2~i~k2k  '  2’ 

as  i  (and  as  a  result  k )  goes  to  infinity.  □ 


Between  the  two  extremes,  Theorems  6  and  7,  we  find  out  that  the  first  situation  is 
more  predominant  (see  Corollary  12).  Next,  we  give  quantitative  results  on  the  sets  Crn . 
However  we  start  with  a  result  which  shows  that  C\  is  of  asymptotic  density  zero  as  one 
would  less  expect. 

5  C\  is  of  density  zero 

Here,  we  show  that  C\  is  of  asymptotic  density  zero.  For  the  purpose  of  this  section  only, 
we  omit  the  index  and  simply  write 

C  =  {1  <  n  :  2n+1  —  1  =  2J  (mod  n)  for  some  j  =  1,2,.. .}. 

It  is  clear  that  C  contains  only  odd  numbers.  Recall  that  for  a  positive  real  number  x  and 
a  set  A  we  put  A(x)  =  A  Fl  [1,  x\.  We  prove  the  following  estimate. 

Theorem  8.  The  estimate 

*C{x)  <<:  (log  log  .x)1/7 


holds  for  all  x  >  ee. 

Proof.  We  let  x  be  large,  and  put  q  for  the  smallest  prime  exceeding 

1  /  log  log  x  \  1//2 
^  2  \  log  log  log  x  J 

Clearly,  for  large  x  the  prime  q  is  odd  and  its  size  is  q  =  (1  +  o(l))y  as  x  — >  oo.  For 
an  odd  prime  p  we  write  tp  for  the  order  of  2  modulo  p  first  defined  at  the  beginning  of 
Section  3.  Recall  that  this  is  the  smallest  positive  integer  k  such  that  2k  =  1  (rnodp). 
Clearly,  tp  \  p  —  1 .  We  put 

V  =  {p  prime  :  p  =  1  (mod  q)  and  tp  \  (p  —  1  )/q}.  (6) 

The  effective  version  of  Lagarias  and  Odlyzko  of  Chebotarev’s  Density  Theorem  (see  [10],  or 
page  376  in  [14]),  shows  that  there  exist  absolute  constants  A  and  B  such  that  the  estimate 

*V(i)  =  CPT  +  °  (expf.^/,,))  (7) 
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holds  for  all  real  numbers  t  as  long  as  q  <  F>(logf)1//8.  In  particular,  we  see  that  estimate 
(7)  holds  when  x  >  xo  is  sufficiently  large  and  uniformly  in  t  £  [z,x],  where  we  take 
z  =  exp((loglogx)100). 


We  use  the  above  estimate  to  compute  the  sum  of  the  reciprocals  of  the  primes  p  £  V(u), 
where  we  put  u  =  x1/100.  We  have 


S 


Y. 

p&V(u) 


1 

P 


E^  +  E 

peP  p&V 

p<z  z<p<u 


1 

P 


5i  +  S2. 


For  ,S'i ,  we  only  use  the  fact  that  every  prime  p  &  V  is  congruent  to  1  modulo  q.  By  the 
Brun-Titchmarsh  inequality  we  have 


Si  < 


p=  i 


E 

P<Z 

(mod  q ) 


l 

-  < 
p 


log  log  z 

<Kq) 


< 


log  log  log  X 

q 


0(1). 


For  S2 ,  we  are  in  the  range  where  estimate  (7)  applies  so  by  Abel’s  summation  formula 


S2  =  E  -« 

z — s  r> 


d#V(t)  #P(t) 


pev 

z<p<u 


p 


t=u 


t=z 


+ 


ir(t)  (  t  1  Y 

q(q-l)t2+  \exp(A  v/log  t/q)J/ 


dt 


q(q-l)tlogt  \q2 


1  \  /  ru 

+  O  [  „  )  +0 


dt 


-  l)i(logt)2 


log  log  u  log  log  z  f  1  \  log  log  X 

q(q  1)  1  2  1  l 


d2/  ?(g-i) 


In  the  above  estimates,  we  used  the  fact  that 


At)  =  T7E7  +  0  1 


logf  v(los  t)2 )  ’ 


as  well  as  the  fact  that 


t 


=  O 


t 


exp(Av/logt/g)  Vt^logt)2 
uniformly  for  t  >  z.  To  summarize,  we  have  that 


s  =  loglogx,  +  0(1)  =  log  Y  +  o  f‘°gl°g:1'  + 1)  =  log  Y  +  0(1). 

q(q-l)  q2  \  <f  )  q2 


(8) 
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We  next  eliminate  a  few  primes  from  V  defined  in  (6).  Namely,  we  let 

Vi  =  {p  :  tp  <  p1/2/{logp)10}, 


V2  =  {p  '■  p  —  1  has  a  divisor  d  in  [p1,/2/(logp)10,p1/,2(logp)10]}. 

A  well-known  elementary  argument  (see,  for  example,  Lemma  4  in  [2])  shows  that 


#Vi  (t)  « 


(logt)2’ 


therefore  by  the  Abel  summation  formula  one  gets  easily  that 

£r0(1)- 

P&V  i 


As  for  V‘2,  results  of  Indlekofer  and  Timofeev  from  [9]  show  that 


#r2(t) « 


t  log  log  t 
(log  t)  1+5  ’ 


where  5  =  2  —  (1  +  In  In  2)/ In  2  =  0.08  . . .,  so  again  by  Abel’s  summation  formula  one  gets 


£ 

p£V2 


We  thus  arrive  at  the  conclusion  that  letting  Q  =  V\{V\  UV2).  we  have 


s'=  y  ~  =  s~  y 

/  qrv  Z. J 


l  =  loglog g 


p&Q{u) 


p£'Pl(u)U'P2(u)  ' 


Now  let  us  go  back  to  the  numbers  n  £  C.  Let  T>\  be  the  subset  of  C(x )  consisting  of  the 
numbers  free  of  primes  in  Q(u).  By  the  Brun  sieve, 


«  x  J]  =*exp  (-  £  l  +  0\  E  W 

P&Q(u)  '  \  p£Q(u)  \p£Q(u) 


<  xexp(-5'  +  0(l))<xexp^-1°^ir^ 


(log  log  x)A+°^)  (log  log  x)3 
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Assume  from  now  on  that  n  £  C(x)\D\.  Thus,  p  \  n  for  some  prime  p  £  Q(u) .  Assume  that 
p2  |  n  for  some  p  £  Q{u).  Denote  by  T>2  the  subset  of  such  n  £  C(x)\T> i.  Keeping  p  £  Q(u) 
fixed,  the  number  of  n  <  x  with  the  property  that  p2  \  n  is  <  x/p2.  Summing  up  now  over 
all  primes  p  =  1  (mod  q )  not  exceeding  x 1^2,  we  get  that  the  number  of  such  n  <  x  is  at 
most 


#^2< 


£ 

p<ad/2 
p=l  (mod  q ) 


p2  g2  log  q 


< 


X 


log  log  X 


(12) 


Let  V 3  =  C(x)\(L>i  U  P2)-  Write  n  =  pm,  where  p  does  not  divide  m.  We  may  also 
assume  that  n  >  x/  log  x  since  there  are  only  at  most  x /  log  x  positive  integers  n  failing  this 
condition.  Put  t  =  tp.  The  definition  of  C  implies  that 

2mp+ 1  =  2j  +  1  (mod  p) 

for  some  j  =  1,2  and  since  2P  =  2  (mod  p),  we  get  that  2mp+1  =  2m+1  (mod  p). 

We  note  that  2m+1  (mod  p)  determines  rn  <  x/p  uniquely  modulo  t.  We  estimate  the 
number  of  values  that  m  can  take  modulo  t.  Writing  X  =  {2J  (mod  p) },  we  see  that 
(mod  p)}  <  7/i,  where  I  is  the  number  of  solutions  (xi,X2,X2)  to  the  equation 

X\  —  X2  —  X3  =  0,  Xl,  X2,  X3  £  X.  (13) 

Indeed,  to  see  that,  note  that  if  m  and  j  are  such  that  2m+1  =  1  +  2-7  (mod  p),  then 
(xi,  X2-,  X3)  =  (2m+1+p,  2y,  2J+y)  for  y  =  0, . . . ,  t  —  1,  is  also  a  solution  of  equation  (13),  and 
conversely,  every  solution  (xi,X2,X3)  =  (2yi,  2y2, 2ys)  of  equation  (13)  arises  from  2m+1  = 
1  +  2J  (mod  p),  where  m  +  1  =  y\  —  r/2  and  j  =  P3  —  P2>  by  multiplying  it  with  2V2. 

To  estimate  /,  we  use  exponential  sums.  For  a  complex  number  2  put  e(z)  =  exp(27ri2:). 
Using  the  fact  that  for  z  £  {0, 1, ...  ,p  —  1}  the  sum 


-£e(-/?) 

is  1  if  and  only  if  z  =  0  and  is  0  otherwise,  we  get 


p- 1 

£  £  e(a(xi  -  x2  -  x3)/p). 
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Separating  the  term  for  a  =  0,  we  get 


r  (#X)3  1^4  v-  .  .  ,  .  ,  t3  1  ^4  ^  2 

I  =  - -  +  -  E  eia(xl  -  *2  -  ®3)/p)  =  —  +  -  E  TaTlai 

P  P  i  ^  v  P  P  , 

a=  1xi,X2,X3GA  a=l 


where  we  put  Ta  =  Sxiex  e(axi/p).  A  result  of  Heath-Brown  and  Konyagin  [8],  says  that 
if  a  /  0,  then 

|ra| « t3' V/A- 

Thus, 

/=  -  +  0(t9/V/4), 

P 

leading  to  the  fact  that  the  number  of  values  of  m  modulo  t  is 

J  f2 

#{m  (mod  t)}  <  -  < - 1-  0(tl^p3^). 

t  p 

Since  also  m  <  x/p,  it  follows  that  the  number  of  acceptable  values  for  m  is 


«  ^  (-  + (‘/y/d  «  ^  + 

pt  \p  Jp 


x 


t7/Spl/4 


(note  that  x/pt  >  1  because  pt  <  p2  <  v?  <  x).  Hence, 


*^<  E  S+  E 


peQ(u) 


pr 


peQ(u) 


tf/Zp1/4 


=  Tl  +  T2. 


For  the  first  sum  T\  above,  we  observe  that  t  <  p/q ,  therefore  t/p 2  <  l/(pq).  Thus,  the 
first  sum  above  is 


Ti«  e 


peQ(u) 


x  xS'  x  log  log  x  (log  log  log  x)3/2 
pq^  q  ^  q3  ^  (log  log  x)1/2 


(14) 


where  we  used  again  estimate  (10).  Finally,  for  the  second  sum  T2,  we  change  the  order  of 
summation  and  thus  get  that 

T2^E^8  E  mTI’  (15) 

t>to  p£Q(u) 

t{p)=t 
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where  to  =  to(q)  can  be  taken  to  be  any  lower  bound  on  the  smallest  t  =  tp  that  can  show 
up.  We  will  talk  about  it  later.  For  the  moment,  note  that  for  a  fixed  t,  p  is  a  prime  factor 
of  2*  —  1.  Thus,  there  are  only  O(logf)  such  primes.  Furthermore,  for  each  such  prime  we 
have  p  >  qt.  Hence, 


T2  < 


qV4  JL  ^9/8' 

1  t>t0 


Since  p  fL  V\  U  V2,  we  get  that  tp  >  p1/2(logp)10.  Since  p  >  2q  +  1,  we  get  that  t  S> 
q1/2 (log  q) 10 .  Thus,  for  large  x  we  may  take  to  =  ql^2(\ogq)9  and  get  an  upper  bound  for 
T>.  Hence, 


T2  < 


v  -  logt  f°°  log s 

“ 1/4  tWa  91/4  59/8  “ 


< 


X 

71/4 


log  s 


,1/8 


< 


< 


g1/2(iogq)9y  g1/4+1/16(logg)1/8  g5/16(log  g)1/8 


< 


x(loglog  logx)1/32 
(log  logx)5/32 


(16) 


Combining  the  bounds  (14)  and  (16),  we  get  that 

^  3  ^  (log  log  x)1/7 ' 

which  together  with  the  bounds  (11)  and  (12)  completes  the  proof  of  the  theorem.  □ 

Although  the  density  of  C\  is  zero,  one  my  try  to  calculate  the  densities  of  Cm  (m  > 
1)  hoping  that  they  are  positive  and  approach  1  as  m  ->  00.  In  the  Figure  2  we  have 
numerically  calculated  the  density  of  C2  within  the  odd  integers  up  to  63201.  Nevertheless, 
we  abandoned  this  idea  having  conjectured  that  the  density  of  each  Cm  is  still  zero.  However, 
the  next  section  gives  a  way  out  to  proving  that  q,/ 2k  goes  to  zero  in  arithmetic  average 
over  odd  integers  k. 


6  The  sets  Cm  for  large  m 

In  this  section,  we  prove  the  following  result. 

Theorem  9.  Put  m(k )  =  Lexp(4000(logloglog  A:)3)J .  The  set  of  odd  positive  integers  k 
such  that  k  £  Cm (*.)  is  of  asymptotic  density  1/2. 
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Figure  2:  The  graph  of  2^|^,  1  <  x  <  63201,  x  odd. 


In  particular,  most  odd  positive  integers  k  belong  to  Cm^y 


Proof.  Let  x  be  large.  We  put 

V  =  (log  log  x)3. 

We  start  by  discarding  some  of  the  odd  positive  integers  k  <  x.  We  start  with 

Ai  =  {k  <  x  :  q2  \  k,  or  q(q  —  1)  |  k,  or  q2  \  cj)(k )  for  some  prime  q  >  y}. 

Clearly,  if  n  G  A\,  then  there  exists  some  prime  q  >  y  such  that  either  q2  \  n,  or  q(q  —  1)  |  n, 
or  q2  |  p  —  1  for  some  prime  factor  p  of  n,  or  n  is  a  multiple  of  two  primes  p\  <  p2  such  that 
q  |  pi  —  1  for  both  i  =  1  and  2.  The  number  of  integers  in  the  first  category  is 


s  E 

y<g<ai1/2 


<  X 


E 


,1/2 


y<q<x1/2 


dt  x 

72  «  -  = 

t 2  y 


(log  log  X )3 


o(x) 


as  x  — >•  oo.  Similarly,  the  number  of  integers  in  the  second  category  is 


s  E 

y<q<x1/2+ 1 


X 

_q{Q~ !). 


^ — -v  1  X 

<  X  >  -TT  <  - 

' '  qz  y 

y^q^x1/2- i-i 


g  /  \ 

(log  log  x) 3 
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as  x  — >  oo.  The  number  of  integers  in  the  third  category  is 


s  £  £ 

y<g,<x1/2  P<x 

p=  1  (mod  q2) 


X 

P 


<x  y 

y<q<x1/2 

P=  1 


£ 

p<x 

(mod 


1 

p 


q2) 


Elog  log  X  ,  . 

—  0  <C  x  log  log  x 

<M<r) 


y<qi£x 1/2 


£ 

y<q<x 1/2 


< 


x  log  log  X 

y 


X 

(log  log  x)  2 


as  x  — >  oo,  while  the  number  of  integers  in  the  fourth  and  most  numerous  category  is 


< 


E  E 


y<q<xx!'1  pi<P2<x 

Pi= 1  (mod  q),  i=l,2 


PlP2. 


E  E 

y<q<x1/'2  pi<P2<x 

Pi  =  1  (mod  q),  i=l,2 


<X 


y<q<x^l2 


£ 


P 

\  P<x  I 

\p=l  (mod  q)  / 


<X  ^ 

y<q<x1^2 


( log  log  X 

V  <t>{q) 


n  i  n2  1  X  log  log  X  2  X 

<C  x(loglogx)2  >  -  — 

qz  y 

y<q<x^-l2 


log  log  X 


=  o(x) 


as  x  — >  oo.  We  now  let 

Q  =  {p  :  tp  <p1/3}, 

and  let  be  the  set  of  k  <  x  divisible  by  some  q  e  Q  with  q  >  y.  To  estimate  we 

begin  by  estimating  the  counting  function  #Q(t)  of  Q  for  positive  real  numbers  t.  Clearly, 

2#S(t)  <  JJ  q<  JJ  (2s  -  1)  <  2E-<*1/3S  <  2*2/3, 
q£Q(t)  s^t1/3 


SO 

#Q(t)  <  t2/3.  (17) 

By  Abel’s  summation  formula,  we  now  get  that 


#a2  <  y 


y<q<x 

q&Q 


X 

q 


2/<g,<x 

qeQ 


i 

-«x 

q 


d#m 

t, 


< 


x 

£73 


X 

log  log  X 


o(x) 


16 


as  x  — >  oo. 

Recall  now  that  P(m)  stands  for  the  largest  prime  factor  of  the  positive  integer  m. 
Known  results  from  the  theory  of  distribution  of  smooth  numbers  show  that  uniformly  for 
3  <  s  <  t,  we  have 


^(t,  s )  =  #{m  <  t  :  P(m)  <  s}  <C  texp(— u/2),  (18) 

where  u  =  logi/logs  (see  [15,  Section  III.4]).  Thus,  putting 

z  =  exp  (32(logloglogx)2)  , 


we  conclude  that  the  estimate 


t 

(log  log  x) 5 


(19) 


holds  uniformly  for  large  x  once  t  >  z,  because  in  this  case  u  =  ^  log  log  log  x, 

therefore 

u  16 

2  >  y  !og  log  log  ah 

so,  in  particular,  u/2  >  5  log  log  log  x  holds  for  all  large  x.  Furthermore,  if  t  >  Z  = 
exp  ( (log  log  x)2),  then 

log  t  (log  log  x) 2 

log  y  3  log  log  log  x  ’ 

so  u/2  >  2  log  log  x  one  x  is  sufficiently  large.  Thus,  in  this  range,  inequality  (19)  can  be 
improved  to 

*(*,£/)«  (0*  - v  «  7T~^2'  (2°) 

exp(2  log  log  x)  (log  x)z 

Now  for  a  positive  integer  m,  we  put  d(m,  y )  for  the  largest  divisor  d  of  m  which  is  y-smooth, 
that  is,  P(d)  <  y.  Let  ^3  be  the  set  of  k  <  x  having  a  prime  factor  p  exceeding  z10  such 
that  d(p  —  1,  y)  >  p1/10.  To  estimate  #^3,  we  fix  a  y-smooth  number  d  and  a  prime  p  with 
z10  <  p  <  d 10  such  that  p  =  1  (mod  d),  and  observe  that  the  number  of  n  <  x  which  are 
multiples  of  this  prime  p  is  <  \_x/p\.  Note  also  that  d  >  p1/10  >  z.  Summing  up  over  all 
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the  possibilities  for  d  and  p.  we  get  that  does  not  exceed 


z<d  p<x 

P(d)<yp=l  (mod  d) 


1 


-  < x  -«i 

2-^j  /  j  n  "  2—j  2-^  n  1  2 -j 


log  log  X 


z<d  p<x  P  z<d 

P{d)<y  p=  1  (mod  d)  P(d)<y 


<C  x(loglogx)2  ^2  ^  <C  x(loglog.x)2  [ 

~  ^  Pi  J  z 


2  fxd*(t,y) 


z<d 

P(d)<y 


<C  x(log  log  x 


2  ,  V(t,y)  i*=® 


®(*,J/) 


*=*  Jz  t2 


dt 


<<(idgi^+x(loglc,gx)  /  — 


In  the  above  estimates,  we  used  aside  from  the  Abel  summation  formula  and  inequality  (19), 
also  the  minimal  order  of  the  Euler  function  <j>(d)/d  3>  1/ log  log  x  valid  for  all  d  G  [l,x].  It 
remains  to  bound  the  above  integral.  For  this,  we  split  it  at  Z  and  use  estimates  (19)  and 
(20).  In  the  smaller  range,  we  have  that 


y{t,y)dt 

t 2 


< 


dt  log  Z  1 

—  <  71 - W - CcT  < 


(log  log  ,x)5  Jz  t  (log  log  x)5  (log  log  x)3 


In  the  larger  range,  we  use  estimate  (20)  and  get 


Jz  t2 

Putting  these  together  we  get  that 


^( t,y)dt  ^  1 


dt  1 

—  < 


(log  x)2  Jz  t  log  X 


#A3  « 


x  .2  I  1 


(log  logx)3 


+  x(log  logic) 


+ 


(log  log  1c)3  logic 


=  o(x) 


as  x  — >  00. 


Now  let  t  =  d(k,  z10).  Put 


w  =  exp(1920(log  log  logic)3), 


and  put  A4  for  the  set  of  k  <  x  such  that  £  >  w.  Note  that  each  such  k  has  a  divisor  d  >  w 
such  that  P(d)  <  z10.  Since  for  such  d  we  have 

log  d 


log  (z 


10' 


=  6  log  log  log  x, 
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we  get  that  in  the  range  t>w,  u/2  >  3  log  log  log  x,  for  large  x,  so 

<  (log  logx)3 

uniformly  for  such  t  once  x  is  large.  Furthermore,  if 


(21) 


t  >  Z\  =  exp(1280  log  log  x(log  log  logx)2), 
then  u  =  ^Jo  >  4  log  log  x  therefore  u/2  >  2  log  log  x.  In  particular, 

10)<<o^ w  <22) 

in  this  range.  By  an  argument  already  used  previously,  we  have  that  j/A^  is  at  most 


< 


E 

w<d<x 

P(d)<z10 


X 

~d\ 


i  r  dy(t,  z 

<x  -  «  x  /  — V 


ION 


w<d<x 

P(d)<z10 


<C  x 


tt( t.,zw )  t=x  r *  y(t,z10)dt 

t  t=w  A 


<C  x 


<C  x 


1 


(log  log  x) 3 

1 


+ 


rZl  y{t,zw)dt,  +  rx  ^{t,z10)dt 


t‘ 2 


lz  1 


log  Zi  log  X 

1/1  i  \  Q  I 


(log  log  x)3  (log  log  x)3  (logx)2 


t2 


=  o(x) 


as  x  — >  oo,  where  the  above  integral  was  estimated  by  splitting  it  at  Z\  and  using  estimates 
(21)  and  (22)  for  the  lower  and  upper  ranges  respectively. 


Let  Ao  be  the  set  of  k  <  x  which  are  coprime  to  all  primes  p  €  [y,z10].  By  the  Brun 
method, 


#a5  <  x 

y<q<z 


1 

1  -  - 
q 


x  log  u 


x 


log  z  log  log  log  X 


=  o(x) 


as  x  — >  oo. 


We  next  let  be  the  set  of  k  <  x  such  that  P(k)  <  re100.  Clearly. 


#Aq  =  ^(x,?/;100)  =  xexp  I  —c\ 


logx 


(log  log  logx)3 


=  o(x) 


19 


as  x  — >  oo,  where  c\  =  1/384000. 


Finally,  we  let 

Ar  =  {k  <  x  :  dp  \  k  for  some  p  =  1  (mod  d)  and  p  <  d3}. 

Assume  that  k  G  A7.  Then  there  is  a  prime  factor  p  of  k  and  a  divisor  d  of  p  —  1  of  size 
d  >  p1/3  such  that  dp  \  k.  Fixing  d  and  p,  the  number  of  such  n  <  x  is  <  [x/(dp)\.  Thus, 


#A7  < 


E  E 


y<p<x  d\p-l 
d>p V3 


X 

dp 


<X  y 


i 

p 


E 

d\p-l 

d>p1/3 


1 

d 


< 


rjp  -  1) 

pi/3 


Ap  -  1) 
pl+l/3 


y<p<x 


i 

p5/4 


[  dt  X  X 

<  x  /  -J-7T  <  — 77  =  - - - - 7577  =  o(x 

Jy  t5 A  y1/4  (log  log  x)3/4 


as  x  — >  oo.  Here,  we  used  r(m)  for  the  number  of  divisors  of  the  positive  integer  m  and  the 
fact  that  r(m)  <C£  me  holds  for  all  e  >  0  (with  the  choice  of  s  =  1/12). 

From  now  on,  k  <  x  is  odd  and  not  in  Ui<j<7^i-  From  what  we  have  seen  above,  most 
odd  integers  k  below  x  have  this  property.  Then  t  <  w  because  k  0  A4.  Further,  k/i  is 
square- free  because  k  $  Ai-  Moreover,  if  p  \  k/£,  then  p  >  z10  >  y,  therefore  tp  >  p1/3 
because  k  $  A-2-  Since  k  0  A3,  we  get  that  d(p  —  1  ,y)  <  p1/10,  so  t'p  =  tp/  gcd (tp,d(p  — 
1  ,y))  >  p1/3-1/10  >  p1/5  for  all  such  p.  Moreover,  tp  is  divisible  only  by  primes  >  z  >  y,  so 
if  pi  and  P2  are  distinct  primes  dividing  k/£,  then  tpi  and  t'p2  are  coprime  because  k  0  Ai. 
Finally,  £  >  y  because  k  0  A5.  Furthermore,  for  large  x  we  have  that  w  >  y,  so  k  >  £  and 
in  fact  k/£  is  divisible  by  a  prime  >  u;100  because  k  ^  Aq. 

We  next  put  n  =  lcm[d((j)(k) ,  y).  (j>{£)\.  We  let  no  stand  for  the  minimal  positive  integer 
such  that  no  =  —k  +  1  (mod  <j>(£))  and  let  m  =  uq  +  £cj>(£).  Note  that 

in  <  2 £(/)(£)  <  2 w2  =  2exp(3840(logloglogx)3). 

We  may  also  assume  that  k  >  x/logx  since  there  are  only  at  most  x/logx  =  o(x)  positive 
integers  k  failing  this  property.  Since  k  >  x/logx,  we  get  that 

m  <  2  exp(3840(logloglogx)3)  <  |_exp(4000(logloglog  k)3)\  =  m(k ) 
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holds  for  large  x.  We  will  now  show  that  this  value  for  m  works.  First  of  all  m  +  k  = 
no  +  £(f>(£)  +  k  =  1  (mod  4>(l))  so 

2 m+k  _  i  =  i  =  2^W_1  +  2^-2  +  •  •  •  +  +  2^-(no~1',+ 

+  2Xin  +  ■  ■  •  +  2Xtn  (modi), 


where  t  =  1(f)(1)  and  x\ , ,xt  are  any  nonnegative  integers.  Let 

JJ  _  2 m+k  _  l  _  2 ^W-1  _  .  .  .  _  2^W_(n0_1)  —  2^W_(no_1l 


Then 

t 

u=j2 2Xi4>(e)  (m°d  f) 

i— 1 


for  any  choice  of  the  integers  x\,...,xt-  Let  p  be  any  prime  divisor  of  k/£.  Clearly, 
gcd(tp,n)  =  d(tp,y),  because  tp  \  4>(k)  and  n  0  A\.  In  particular, 


tp  gcd  (tp,n) 


> 


gcd(d(0(A:),y),p-  1) 


>  pl/3-1/10  >  pl/5 


Let  X  =  {2jn  (mod  p)}.  Certainly,  the  order  of  2”  modulo  p  is  precisely  t’p.  So,  =  t'p  > 
p 1/5.  A  recent  result  of  Bourgain,  Glibichuk  and  Konyagin  (see  Theorem  5  in  [1]),  shows 
that  there  exists  a  constant  T  which  is  absolute  such  that  for  all  integers  A,  the  equation 


A  =  2xin  +  •  •  •  +  2Xtn  (mod  p) 


has  an  integer  solutions  0  <  x\, ...  ,xt  <  t'p  once  t  >  T.  In  fact,  for  large  p  the  number  of 
such  solutions 

N(t,p,  A)  =  #{(xi,---,xt)  :  0  <  xi,...,xt  <  tp} 


satisfies 


N(t,p,  A)  G 


'#X*  2#Xr 

2  p  ’  p 


independently  in  the  parameter  A  and  uniformly  in  the  number  t.  In  particular,  if  we  let 
N\(t,p,  A)  be  the  number  of  such  solutions  with  xt  =  Xj  for  some  i  /  j ,  then  N\(t,p ,  A)  <C 
Indeed,  the  pair  (i,  j)  with  i  =/=■  j  can  be  chosen  in  0(t 2)  ways,  and  the  common 
value  of  Xi  =  Xj  can  be  chosen  in  ^X  ways.  Once  these  two  data  are  chosen,  then  the 
number  of  ways  of  choosing  xs  G  {0, 1, . . . ,  t'p  —  1}  with  sG  {1,2,...,  j}  such  that 

A  _  2Xin  _  2Xjn  =  2Xan  (mod  p) 

1  <s<t 

s^i,j 
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is  N(t  —  2,p,  A  —  2Xin  —  2xin)  <C  #Xt_2/p  for  t  >  T  +  2.  In  conclusion,  if  all  solutions 
x\, . . . ,  xt  have  two  components  equal,  then  p1/-5  <C  #A  <C  f2,  so  p  <C  t10.  For  us,  t,  <  2 w2, 
so  p  <  re20.  Since  P(k)  =  P(k/£ )  >  re100,  it  follows  that  at  least  for  the  largest  prime 
factor  p  =  P(k)  of  k,  we  may  assume  that  x\ , ,xt  are  all  distinct  modulo  p  for  a  suitable 
value  of  A. 

We  apply  the  above  result  with  A  =  U,  t  =  £(/)(£)  (note  that  since  t  >  y,  it  follows  that 
t  >  T  +  2  does  indeed  hold  for  large  values  of  x),  and  write  x(p)  =  (aq(p), . . . ,  xt(p))  for  a 
solution  of 

u  =  2xi(p)n  +  ...  +  2xt(p)n  (modp);  0<xi(p)<...<xt(p)<^. 

We  also  assume  that  for  at  least  one  prime  (namely  the  largest  one)  the  Xj(p)’ s  are  distinct. 
Now  choose  integers  x\ , ,xt  such  that 

Xi  =  Xi(p)  (mod  t'p) 

for  all  p  |  k/£.  This  is  possible  by  the  Chinese  Remainder  Lemma  since  the  numbers  t'p 
are  coprime  as  p  varies  over  the  distinct  prime  factors  of  k/£.  We  assume  that  for  each  i, 
Xi  is  the  minimal  nonnegative  integer  in  the  corresponding  arithmetic  progression  modulo 
Yip\k/e^p-  Further,  since  nxi(p)  are  distinct  modulo  t’p  when  p  =  P(k),  it  follows  that  nxi 
are  also  distinct  for  i  =  1  Hence,  for  such  x\  s  we  have  that  U  —  Yl\= 12Xin  is  a 

multiple  of  all  p  \  k/£,  and  since  k/£  is  square-free,  we  get  that  U  =  X^i=i  2Xin  (mod  k/£). 
But  the  above  congruence  is  also  valid  modulo  £,  so  it  is  valid  modulo  lcrn {£.  k/£\  =  k,  since 
£  and  k/£  are  coprime.  Thus, 


t. 

U  =  J2  2Xin  (mod  k), 
i=l 


2k+m-i  _  ^  =  2^W_1  2^)-(no_1)  -f-  2^W_(no_1)  _)_  E  2x‘n  (mod  k). 

i= 1 

As  we  have  said,  the  numbers  Xin  are  distinct  and  they  can  be  chosen  of  sizes  at  most 
n  lem [tp  :  p  \  k£\  <  <f>(k)  <  k.  Finally,  nxi  are  divisible  by  (j>(£)  whereas  none  of  the  numbers 
4>(£)  —  j  for  j  =  1, . . . ,  no  —  1  is  unless  no  =  1.  Thus,  assuming  that  71q  /  1,  we  get  that  all 
the  m  =  t  +  no  exponents  are  distinct  except  for  the  fact  that  (j>(£)  —  (no  —  1)  appears  twice. 
Let  us  first  justify  that  no  ^  1.  Recalling  the  definition  of  no,  we  get  that  if  this  were  so  then 
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<f(£)  |  k.  However,  we  have  just  said  that  l  has  a  prime  factor  p  >  y.  If  cf(£)  |  k,  then  k  is 
divisible  by  both  p  and  p—  1  for  some  p  >  y  and  this  is  impossible  since  n  fL  A\.  Finally,  to 
deal  with  the  repetition  of  the  exponent  f>(£)  —  (no  —  1),  we  replace  this  by  <f(£)  —  (no  —  1  )+tk, 
where  as  usual  tk  is  the  order  of  2  modulo  n.  We  show  that  with  this  replacement,  all  the 
exponents  are  distinct.  Indeed,  this  replacement  will  not  change  the  value  of  1  )+4 

(mod  k).  Assume  that  after  this  replacement,  <f(£)  —  (no  —  1)  +tk  is  still  one  of  the  remaining 
exponents.  If  it  has  become  a  multiple  of  n,  it  follows  that  it  is  in  particular  divisible  by  tp  for 
all  primes  p  \  £.  Since  tp  \  tk  and  tp  \  cj>(£)  for  all  primes  p  \  £,  we  get  that  tp  |  no  —  1,  so  tp  \  k. 
Since  £  is  divisible  by  some  prime  p  >  y  (because  k  0  A&),  we  get  that  tp  \  k.  Since  k  0  A2, 
we  get  that  tp  >  plA .  Thus,  k  is  divisible  by  a  prime  p  >  y  and  a  divisor  d  of  p  —  1  with 
d  >  p 1/3,  and  this  is  false  since  n  0  A7.  Hence,  this  is  impossible,  so  it  must  be  the  case  that 
<j>(£)  —  (no— l)+tfc  G  {(/>(£)  —  1,  •  •  • ,  <])(£)  — (no  — 1)}.  This  shows  that  tk  <  no  <  £$(£)  <  210w20. 
However,  tk  is  a  multiple  of  tp(k)  —  P(k)1^3,  showing  that  P(k)  <  230w60,  which  is  false 
for  large  x  since  k  0  Ae-  Thus,  the  new  exponents  are  all  distinct  for  our  values  of  k.  As 
far  as  their  sizes  go,  note  that  since  k  has  at  least  two  odd  prime  factors,  it  follows  that 
tk  |  <l>(k)/ 2,  therefore  <f>(£)  —  (no  —  1)  +  tk  <  w  +  4>(k)/ 2  <  w  +  k/2  <  k  since  k  >  2w  for 
large  x.  Thus,  we  have  obtained  a  representation  of  2k+Tn  —  1  modulo  m  of  the  form 

2J1  +  •  •  •  +  2-7m  (mod  k ) 

where  0  <  j\  <  . . .  <  jm  <  k,  which  shows  that  k  G  Cm.  Since  m  <  m(k)  and  Cm  C  Cmn^, 
the  conclusion  follows.  □ 

Remark  10.  The  above  proof  shows  that  in  fact  the  number  of  odd  k  <  x  such  that  k  fL  Cmik\ 
is  0(:r/ log  log  log  x). 

Corollary  11.  For  large  x,  the  inequality  ck/2k  <  2m(fc)  (losfe)/(log2)  holds  for  all  odd  k  <  x 
with  at  most  0(x/logloglogx)  exceptions. 

Proof.  This  follows  from  the  fact  that  ck  =  ak/k  <  2k+m/k,  where  k  G  Crn  (see  Theorem 
6),  together  with  above  Theorem  9  and  Remark  10.  □ 

Corollary  12.  The  estimate 


1 

x 


ECk 
2k 

1  <k<x 
k  odd 


o 


1 


log  log  log  X 


holds  for  all  x. 
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X 


Proof.  If  k  <  x /  log  x  is  odd,  then  c*,/ 2k  <  1,  so 


E  -< 


k<x/  log  x 
k  odd 


2k  log : 


If  k  6  [x/\ogx,x\  but  k  0  Cmrk\,  then  still  Ck/2k  <  1  and,  by  the  Corollary  11,  the  number 
of  such  fc’s  is  0(x/logloglogx).  Thus, 


E  |« 

k£[x/  log#,#] 
k  odd 


X 

log  log  log  X 


For  the  remaining  odd  values  of  k  <  x,  we  have  that 

<  2m(fc)-(1°gfc)/(1°g2) 
2k  ~  ’ 

so  it  suffices  to  show  that 


is  equivalent  to 


2«i(fc)  —  (log  k)/ (log  2)  <  1 

log  log  log  X  ’ 


(log  fc) / (log  2)  —  m(k)  >  log  log  log  log  x/  log  2, 
which  in  turn  is  implied  by 

log(a;/ log  x)  —  log  log  log  log  x  >  (log  2)  exp(4000(logloglogx)3), 
and  this  is  certainly  true  for  large  x.  Thus,  indeed, 


ECfc 
2k 

1  <k<x 
k  odd 


o 


log  log  log  x 


which  is  what  we  wanted  to  prove. 


□ 


In  particular, 


l  E  |  =  °(D  (23) 

k<x 
k  odd 

as  x  — >  oo.  One  can  adapt  these  techniques  to  obtain  that  the  whole  sequence  Ck/2k  is 
convergent  to  0  in  arithmetic  average.  In  order  to  do  so,  the  sets  Cm  should  be  suitably 
modified  and  an  analog  of  Theorem  9  for  these  new  sets  should  be  proved.  We  leave  this 
for  a  subsequent  work. 
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7  Existence  and  bounds  for  in  base  q  >  2 


Let  q  >  2  be  a  fixed  integer  and  let  x  be  a  positive  real  number.  Put 

Vk(x)  =  {0  <  n  <  x  :  sq(n)  =  k}, 

Vk{x\  h,m)  =  {0  <  n  <  x  :  sq(n)  =  k,n  =  h  (mod  m)}. 

Mauduit  and  Sarkozy  proved  in  [12]  that  if  gcd (m,q(q  —  1))  =  1,  then  there  exists  some 
constant  Co  depending  on  q  such  that  if  we  put 

i  =  min  {A’,  (q  —  1) [logo;/ log gj  —  k}  , 

then  Vk(x)  is  well  distributed  in  residues  classes  modulo  m  provided  that  m  <  exp(co^1//2). 
Taking  m  =  k  and  h  =  0,  we  deduce  that  if  k  <  exp(co£1^2),  then 

Vk{x\  0,  k )  =  (1  +  o(l))Vk(x)/k 

as  x  — >  oo  uniformly  in  our  range  for  k.  The  condition  on  k  is  equivalent  to  log  A;  <C  A1/2, 
which  is  implied  by  k  +  0((logfc)2)  <C  logx.  Thus,  we  have  the  following  result. 

Lemma  13.  Let  q  >  2  be  fixed.  There  exists  a  constant  c\  such  that  if  k  is  any  positive 
integer  with  gcd (k,q(q  —  1))  =  1,  then  Vk(x)  is  well  distributed  in  arithmetic  progressions 
of  modulus  k  whenever  x  >  exp(ci  k) . 

Corollary  2  of  [12]  implies  that  if 


A  = 


q- 1 


21ogg 


log  x  —  k 


=  o(log x)  as  x  — >  oo, 


(24) 


then  the  estimate 


Wk(x)  = 


X 


(logx)1/2 


exp 


A2 

"Qs  i - O 

logx 


A3 


+ 


(logx)2  (logx)1/2 


holds  with  some  explicit  constant  C3  depending  on  q.  As  a  corollary  of  this  result,  we  deduce 
the  following  result. 

Lemma  14.  If  condition  (24)  is  satisfied,  then  Vk{x)  0. 

In  case  k  and  q  are  cop  rime  but  k  and  q  —  1  are  not,  we  may  apply  instead  Theorem  B 
of  [11]  with  m=  k  and  h  =  0  to  arrive  at  a  similar  result. 
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Lemma  15.  Assume  that  q  >  2  is  fixed.  There  exists  a  constant  C4  depending  only  on  q 
such  that  if  k  is  a  positive  integer  with  gcd (k,  q)  =  1,  and  x  >  exp(c4 k),  then  Vk(x ;  0,  k )  0. 


One  can  even  remove  the  coprimality  condition  on  q  and  k.  Assume  that  x  is  sufficiently 
large  such  that 

A  <  c5(logx)5/8,  (25) 

where  C5  is  some  suitable  constant  depending  on  q.  Using  Theorem  C  and  Lemma  5  of  [11] 
with  m  =  k  and  h  =  0,  we  obtain  the  following  result. 

Lemma  16.  Assume  that  both  estimates  (25)  and  k  <  20oga:)1/4  hold.  Then  T4(x;  0,  k)  0. 


A  sufficient  condition  on  x  for  Lemma  16  above  to  hold  is  that  x  >  exp^c^k),  where  cq  is 
a  constant  is  a  constant  that  depends  on  q.  Putting  Lemmas  15  and  16  together  we  obtain 
the  next  theorem. 

Theorem  17.  For  all  q  >  2  there  exists  a  constant  cq  depending  on  q  such  that  for  all 
k  >  1  there  exists  n  <  exp(cek)  with  sq(kn)  =  k. 


Consequently,  ak  =  exp(0(fc))  for  all  k,  and  in  particular  it  is  nonzero.  The  following 
example  of  Lemma  18  shows  that  ak  =  exp (o(k))  does  not  always  hold  as  k  — >  00. 


Lemma  18.  If  q  >  2,  then 


n  —  „m 

(Iqtn  —  q 


If  q  =  2,  then  02™  =  2m(22"1  —  1). 

Proof.  The  fact  that  sq(aqm )  =  qm  for  all  q  >  2  is  immediate.  We  now  show  the  minimality 
of  the  given  aq™  with  this  property.  Let  arn  =  ( qm  —  l)/(q  —  1).  Note  that  every  digit  of 
qfim  —  1  in  base  q  is  maximal,  so  g“m  —  1  is  minimal  such  that  sq(qam  —  1)  =  qm  —  1.  Since 

qam  -  1  =  (q  —  1  )qQm~1  +  (q  -  1  )qam~2  H - f  (q  -  1), 

then  aqm  must  contain  the  least  term  g*,  where  t  >  am  —  1  such  that  its  sum  of  digits  is 
qm  and  qm\aqm.  The  least  term  is  obviously  g“m,  and  it  just  happens  that  aqm  such  defined 
satisfies  the  mentioned  conditions.  □ 
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